sketch size
- Asia > Middle East > Saudi Arabia (0.04)
- North America > United States > Connecticut > New Haven County > New Haven (0.04)
- North America > Canada (0.04)
- (5 more...)
- Africa > Senegal > Kolda Region > Kolda (0.05)
- North America > United States > Illinois (0.04)
- Asia > Afghanistan > Parwan Province > Charikar (0.04)
Communication-efficientDistributedSGDwith Sketching
However,theoretical and empirical evidence both suggest that there is a maximum mini-batch size beyond which the number of iterations required toconvergestops decreasing, andgeneralization error begins toincrease [Maetal.,2017,Lietal., 2014, Golmant et al., 2018, Shallue et al., 2018, Keskar et al., 2016, Hoffer et al., 2017]. In this paper, we aim instead to decrease the communication cost per worker.
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.05)
- Europe > Sweden > Stockholm > Stockholm (0.04)
- North America > United States (0.04)
- Asia > Afghanistan > Parwan Province > Charikar (0.04)
- Information Technology > Security & Privacy (0.93)
- Information Technology > Data Science > Data Mining (0.68)
- Information Technology > Communications (0.68)
- Information Technology > Artificial Intelligence > Machine Learning (0.46)
- North America > United States (0.14)
- Asia > Afghanistan > Parwan Province > Charikar (0.04)
- Africa > Senegal > Kolda Region > Kolda (0.05)
- North America > United States > Illinois (0.04)
Cost-efficient Gaussian tensor network embeddings for tensor-structured inputs
This work discusses tensor network embeddings, which are random matrices ($S$) with tensor network structure. These embeddings have been used to perform dimensionality reduction of tensor network structured inputs $x$ and accelerate applications such as tensor decomposition and kernel regression. Existing works have designed embeddings for inputs $x$ with specific structures, such as the Kronecker product or Khatri-Rao product, such that the computational cost for calculating $Sx$ is efficient. We provide a systematic way to design tensor network embeddings consisting of Gaussian random tensors, such that for inputs with more general tensor network structures, both the sketch size (row size of $S$) and the sketching computational cost are low.We analyze general tensor network embeddings that can be reduced to a sequence of sketching matrices. We provide a sufficient condition to quantify the accuracy of such embeddings and derive sketching asymptotic cost lower bounds using embeddings that satisfy this condition and have a sketch size lower than any input dimension.
- North America > United States > New York (0.04)
- North America > United States > California > Los Angeles County > Long Beach (0.04)